Overview

Dataset Statistics

Number of Variables 3
Number of Rows 5000
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 1470
Duplicate Rows (%) 29.4%
Total Size in Memory 117.3 KB
Average Row Size in Memory 24.0 B
Variable Types
  • Numerical: 2
  • Categorical: 1

Dataset Insights

Dataset has 1470 (29.4%) duplicate rows Duplicates
test_result has constant length 1 Constant Length

Variables


age

numerical

Approximate Distinct Count 69
Approximate Unique (%) 1.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 80000
Mean 51.609
Minimum 18
Maximum 90
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.2209)

Quantile Statistics

Minimum 18
5-th Percentile 34
Q1 43
Median 51
Q3 60
95-th Percentile 71
Maximum 90
Range 72
IQR 17

Descriptive Statistics

Mean 51.609
Standard Deviation 11.287
Variance 127.3964
Sum 258045
Skewness 0.2209
Kurtosis -0.3613
Coefficient of Variation 0.2187
  • age has 3 outliers

physical_score

numerical

Approximate Distinct Count 404
Approximate Unique (%) 8.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 80000
Mean 32.7603
Minimum 0
Maximum 50
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • physical_score is skewed left (γ1 = -0.779)

Quantile Statistics

Minimum 0
5-th Percentile 17.295
Q1 26.7
Median 35.3
Q3 38.9
95-th Percentile 42.7
Maximum 50
Range 50
IQR 12.2

Descriptive Statistics

Mean 32.7603
Standard Deviation 8.1698
Variance 66.7457
Sum 163801.3
Skewness -0.779
Kurtosis -0.2034
Coefficient of Variation 0.2494
  • physical_score is not normally distributed (p-value 0.001555869631343877)
  • physical_score has 12 outliers

test_result

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 330000

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 5000
  • The top 2 categories (1, 0) take over 50.0%
  • test_result has words of constant length

Interactions

Correlations

Missing Values